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TETRACYCLINE REPRESSOR -MEDIATED BINARY 

REGULATION SYSTEM FOR CONTROL OF 
GENE EXPRESSION IN TRANSGENIC ANIMALS 

5 1. ItfTROPVCTION 

The present invention relates to a tetracycline 
repressor-mediated binary regulation system for the 
control of gene expression in transgenic animals- It 
is based, at least in part, on the discovery that, in 

10 a non-human transgenic animal that carries a first 
transgene under the control of a modified promoter 
comprising a tetR operator sequence and a second 
transgene encoding the tetR repressor protein, 
expression of the first transgene may be efficiently 

15 induced by administering tetracycline to the animal. 

2. BACKGROUND OF THE INVENTION 

2.1. CONTROL OF GENE EXPRESSION 
XN TRANSGENIC ANIMALS 

20 The production of transgenic animals for both 

experiment and agricultural purposes is now well known 
(Wilmut et al., 7 July 1988, New Scientist pp. 56-59). 
In research , transgenic animals are a powerful tool 
that have made significant contributions to our 

25 understanding of many aspects of biology and have 
contributed to the development of animal models for 
human diseases (Jaenisch, 1988, Science 240 : 1468- 
1474). It is also clear that several livestock 
species can be made transgenic and these species 

30 promise to expand and revolutionize the method of 
production and diversity of pharmaceutical products 
available in the future, in addition to improving the 
agricultural qualities of the livestock species 
(Wilmut et al., supra ) . 

35 a critical, often neglected, aspect of developing 

transgenic animals is the process whereby expression 
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of the newly introduced gene, referred to as the 
transgene, is controlled. This is an important 
process since stringent regulation of transgene 
5 expression is often important both for practical, 
regulatory and safety reasons and to maintain the 
health of the transgenic animal. In the past either 
"inducible" or "tissue specific" regulatory mechanisms 
have been used. Inducible regulation is defined 

10 herein as a method of gene regulation which allows for 
some form of outside manipulation of the onset and/or 
level of transgene expression. Tissue specific 
regulation is defined herein as a method for targeting 
transgene expression to particular tissues or organs. 

15 Inducible gene regulation may be achieved using 

relatively simple promoter systems such as the 
metallothionein heat shock promoters, or by using 
promoters which are responsive to specific compounds 
such as the Mouse mammary tumor virus LTR which is 

20 responsive to glucocorticoid stimulation. More 

flexible, though more complex inducible regulation 
systems can be achieved through a "binary" gene 
approach which utilizes a transactivator gene product 
to control expression of a second gene of interest. 

25 Tissue specific gene regulation usually consists of 
simple single gene methods (Byrne et al., 1989, Proc. 
Natl. Acad. Sci. U.S.A. 86:5473-5477; Ornitz et al., 
1991, Proc. Natl. Acad. Sci. U.S.A. 88:698-702), 
although binary transactivator systems can also 

3 0 provide a high degree of tissue specificity. 

These current systems provide only a limited 
ability to control the time of transgene expression 
within individual animals. In this respect tissue 
specific promoter elements provide no method to 

3 5 control the onset of transgene activity, but function 
merely to target gene expression to defined sites. 
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Simple inducible promoters such as metallothionein 
generally lack tissue specificity and usually have 
some aspect of endogenous basal expression which 
5 cannot be controlled. Thus even for the extensively 
used inducible metallothionein promoter this approach 
at best only permits selection of the time at which a 
relative increase in transgene expression can be 
induced . 

10 Binary transactivation systems typically consist 

of two transgenic animals. One animal contains the 
gene of interest controlled by a promoter element that 
requires a specific transactivator gene product for 
expression. Thus, the gene of interest is not 

15 expressed in the absence of the transactivator. A 

second transgenic animal is then made which expresses 
the required transactivator in the desired tissue. By 
mating these two transgenic animals, offspring 
containing both the gene of interest and the 

20 transactivator transgene can be produced. Only in 
these doubly transgenic animals is the gene of 
interest expressed. Since expression of the gene of 
interest requires the transactivator, this binary 
approach dramatically reduces or eliminates any 

25 undesirable basal expression inherent in simple 

inducible systems. Additionally, if expression of the 
transactivator is targeted using a tissue specific 
promoter, then in the double transgenics, expression 
of the gene of interest is in effect targeted to the 

30 same specific tissue. Binary systems provide 
therefore a low resolution method of temporal 
regulation in as much as they allow the determination 
of which generation of animals will express the gene 
of interest. These systems provide little ability, 

35 however, to control the time and level of gene 

expression within an individual transgenic animal. 
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For many applications it is necessary to 
accurately control the time and pattern of transgene 
expression within an individual transgenic animal. 
5 For example, many attempts have been made to produce 
transgenic pigs which express increased levels of 
growth hormone (Vize et al., 1988, J, Cell Sci. 
90i:295-300;; Pinkert et al., 1990, Dom. Animal 
Endocrinol. 7:1-18). Elevated growth hormone levels 

10 dramatically decrease the amount of body fat in pigs, 
and increase the animals overall feed efficiency. 
These effects would be beneficial, both to the 
consumer who could purchase a leaner, healthier 
product, and to the producer who can profit from 

15 having a more efficient animal. To date however, all 
attempts to increase the level of growth hormone 
through production of transgenic pigs have also 
produced serious pathological conditions which greatly 
reduce the health of the animals. These pathologies 

2o are the direct result of uncontrolled, constitutive 

expression of growth hormone, since many studies using 
exogenous hormone administration for short periods of 
time have not produced pathologies, while still 
benefiting feed efficiency and fat content. In this 

25 situation, a regulatory method to control onset and 
level of expression from a growth hormone transgene 
would be extremely useful. 

2.2. REPRESSOR-MEDIATED GENE CONTROL 
30 Transcriptional repressors are usually allosteric 

DNA binding proteins with at least two functional 
sites. One site on the protein is used to bind DNA. 
The DNA binding site binds to a defined DNA sequence 
which is known as the operator site. Operator sites 
35 usually consist of palindromic sequences of 12 or more 
base pairs. A gene which is regulated by a repressor 
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repressor causes repression of mammalian promoters 
through two basic mechanisms. If the operators are 
located downstream of the transcription start site, 
5 Lacl appears to block expression by inhibiting mRNA 
elongation. That is to say, the Lacl repressor blocks 
the progress of RNA polymerase by steric interference. 
When operator sequences are located in other 
positions, Lacl seems to inhibit protein-protein 

10 interactions between the cellular factors normally 
involved in transcription initiation. 

Gatz and Quail (1988, Proc. Natl. Acad. Sci. 
U.S.A. 85:1394-1397) have demonstrated tetR function 
in a plant protoplast culture system. Plant 

15 protoplasts were transfected with a tetR gene 

expressed from a cauliflower mosaic virus (CAMV) 
promoter along with a CAT reporter gene, regulated by 
a modified CAMV promoter. In contrast to the results 
with Lacl, Gatz and Quail showed that tetR operators 

2o positioned between the transcription start site and 

the first codon of the CAT mRNA were not responsive to 
tetR repression. Therefore the tetR protein does not 
appear to be able to block the procession of RNA 
polymerase. Effective repression by tetR was only 

25 observed when the operator sequence was positioned 

such that the CAMV TATA-box element was flanked by the 
two 19bp palindromes of the tetR operator. With this 
modification, effective repression of the reporter 
gene, and induction with tetracycline could be 

3 0 achieved. This suggests that repression by tetR 

specifically inhibits the initiation of transcription, 
in this case apparently by blocking the binding of the 
TATA-box binding factors. 

Recently the tetR system has been shown to 

35 function in transgenic plants. Gatz et al. (1991, 
Mol. Gen. Genet. 227:229-237) have introduced their 
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original tetR responsive CAMV promoter, in which the 
operator sites flank the TATA-box into transgenic 
tobacco plants. Unexpectedly, this promoter, which 
5 exhibited very good regulation in tissue culture 
assays was not very effective in regulating gene 
expression in transgenic plants. Instead they found 
that effective repression and induction in transgenic 
plants occurred when the operator sites were 
10 positioned just downstream of the normal transcription 
start site. 

3. SUMMARY OF THE INVENTION 
The present invention relates to a tetracycline 

15 repressor-mediated binary regulation system for the 
control of gene expression in non-human transgenic 
animals. It is based, at least in part, on the 
discovery that in transgenic mice carrying two 
transgenes, the first encoding bovine growth hormone 

20 (bGH) under the control of a PEPCK promoter modified 
to comprise the tetR operator sequence at the Nhel 
site, and the second encoding tetR repressor protein 
under the control of an unmodified PEPCK promoter, 
expression of bGH could be efficiently and selectively 

25 induced by administering tetracycline to the 
transgenic mice. 

In particular embodiments, the present invention 
provides for (i) animal promoter elements modified to 
comprise a tetR operator sequence; (ii) nucleic acid 

3 0 molecules comprising a gene of interest under the 

control of such a modified promoter; (iii) non-human 
transgenic animals that carry a transgene under the 
control of said modified promoter and/or a transgene 
encoding the tetR repressor protein; and (iv) a method 

35 of selectively inducing the expression of a gene of 
interest in a non-human transgenic animal comprising 
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administering tetracycline to a non-human transgenic 
animal that carries a first transgene, which is the 
gene of interest under the control of a promoter 
5 modified to comprise a tetR operator sequence and a 
second transgene encoding the tetR repressor protein. 

The present invention offers the advantage that, 
in the absence of tetracycline, expression of the gene 
of interest occurs at only very low levels due to 

10 efficient repression by tetR. In preferred, non- 
limiting embodiments of the invention, repression by 
tetR is further enhanced by utilizing a synthetic tetR 
gene which is devoid of splice signals and has 
optimized codon usage for mammalian cells. 

15 Accordingly, the present invention allows tight 

control of gene expression in transgenic animals by 
withholding or administering tetracycline. 

4. DESCRIPTION OF THE FIGURES 

20 Figure 1. A. Nucleotide sequence of tetR operator 

as it occurs in TnlO, and in the oligonucleotides 
used to produce the modified PEPCK promoter 
elements. Bold face lettering represent the OP1 
and OP2 tetR binding sites. The general purpose 

25 oligonucleotide is the sequence from pdd7. The 

flanking EcoRI and AccI restriction sites used to 
excise this operator sequence are indicated. 
Additional restriction sites present in the 
plasmid, but not indicated here, which can be 

3 0 used to excise the operator include PstI, BamHI , 

Spel, Sbal, NotI, EagI, SacII, BstXI, and SacI on 
the 5 1 side and Xhol, Apal and Kpnl on the 3 1 
side. The sequence of the PEPCK-TATA box 
operator is also indicated (see methods) . 

35 Figure 1. B. Nucleotide sequence of the ddl 
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operator. Lower case letters correspond to 
polylinker sequence. The 5 1 EcoRI and 3' AccI 
restriction sites used for producing the modified 
5 PEPCK promoters (Pck_ A and Pck-N) are indicated. 

The 10 base pair linker beween 0P1 and OP2 is 
underlined. Additional polylinker restriction 
sites available in pd37 include PstI, BamHI, 
Spel, Xbal, NotI, EagI, SacII / BstXI,and SacI on 
10 the 5' side and Xhol, Apal and Kpnl on the 3' 

side. 

Figure 2. A representation of the three modified 

PEPCK promoter elements. Construct 251 has the 
337 operator sequence integrated in the AccI site 

15 of PEPCK, just 5* of the TATA-box control 

element. Construct 252 has the 337 operator 
sequence incorporated into the Nhel site of 
PEPCK , just 3' of the TATA-box element. 
Construct 261 incorporates the TATA-specif ic 

20 operator sequence which is integrated between the 

5' AccI site and the 3 1 Nhel sites. 
Figure 3. Structure of the modified PEPCK controlled 
bovine growth hormone genes. The Pck_AbGH and 
Pck_NbGH genes differ only in the site of 

25 operator insertion. For PckJkbGH the operator is 

inserted at the AccI site 5' of the PEPCK TATA- 
box element. For Pck_NbGH the operator is 
inserted into the Nhel site 3 1 of the TATA-box 
element (pPCK_NbGH has been deposited with the 

30 ATCC and assigned accession No: ) . In the 

PckJTbGH gene, a TATA-box specific 
oligonucleotide was used, and this sequence was 
inserted between both the AccI and Nhel sites. 
A. Indicated the probe used for Si hybridization. 

35 Figure 4. Si Nuclease protection assay to map the 5' 
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start site of bGH from the Pck_N promoter. Total 
liver RNA (lO^g) was hybridized to a 280 bp 5' 
labelled probe from the Pck_NbGH gene in 4 0mM 
5 PIPES (Ph6.4), IMm EDTA, 400mM NaCl, 80% 

formamide at 55° overnight. The probe spanned 
from the Hinfl site in the 5' untranslated leader 
sequence of bGH to the PvuII site 5 1 of the TATA- 
17 box. The probe includes the tet-operator 

10 sequence of Pck_N (see Figure 3). After 

hybridization 300 pi of ice cold digestion buffer 
(280mM NacL, 50Mm SODIUM ACETATE (Ph4.5), 4 . 5Mm 
ZnS0 4/ 20Mg/ml carrier DNA and 500 units SI 
nuclease) was added and incubated at 37° for 30 

15 minutes. The reaction as stopped by adding 80^1 

of Stop Buffer (4M Ammonium acetate, 50mM EDTA 
and 50/ig/ial tRNA) , extracted with 
phenol/chloroform, precipitated with ethanol and 
analyzed on a 6% sequencing gel. The arrow 

20 indicates the protected fragment. Initiation of 

bGH mRNA from the modified Pck_N promoter occurs 
approximately 20 bp 3 • of the TATA-box. This 
initiation site places the start of the message 
just prior to the first tetR binding site. This 

25 result indicates that the bGH mRNA starts from a 

single cap site, and suggests that tetR 
repression is due to a block in transcription 
initiation. Furthermore, unrepressed bGH 
expression appears to be due to limited tetR 

3 0 expression. 

Figure 5. Nucleotide sequence of the tetR repressor 

protein gene. 
Figure 6. Alterative, nonlimiting promoters of 

interest. Asterisks indicate sites at which tetR 

35 operator sequence may be inserted. 

Figure 7. Northern blot analysis of bGH mRNA in liver 



of Fl generation animals. 

Figure 8. Northern blot analysis of bGH mRNA 
expression in four transgenic lines. 

Figure 9A. Tissue specificity of bGH expression in 
Line 10-2 in the presence of 50 tig/ml 
tetracycline. Northern blot analysis of bGH 
induction in a variety of tissues . Only the 
liver and kidney show significant expression. 

Figure 9B. Tetracycline induction of bGH in Line 10-2. 
Both liver and kidney, which are the only sites 
for bGH expression in Figure 9A, also show 
tetracycline dependent bGH expression. 

Figure 10. 345 Repressor Construct. 

Figure 11. Induction of bGH expression in Construct 
345 Offspring. Northern blot analysis of liver 
RNA from Fl animals containing the 345 construct. 
Only animals from line 14 exhibit tetracycline 
dependent bGH expression. 

Figure 12. Expression and alternative processing of 
tetR transgene. A RNase protection probe which 
extends from the Nrul site of tetR 3 1 to the end 
of the gene was used. This probe includes only 
tetR coding sequences and should give a fully 
protected fragment of approximately 4 00 base 
pairs. A protected fragment of approximately 
220-260 base pairs is observed, which is far 
smaller then predicted. 

Figure 13. 5' Structure of tetR mRNA. Liver RNA was 
treated with reverse transcriptase and amplified 
by PCR. The RNA was amplified using two 
different pairs of primers. The first primer 
pair (TZ-1 and TZ-4) should produce a 619 base 
pair product. The second primer pair (T203 and 
TZ04) should produce a 498 base pair product. 
The sequence of the primers are: 
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TZ-1 : 5 1 CCGC ATATGATCAATTCAAGGCCGAATAAG 3 1 
TZ-3 : 5 • CTTTAG CG ACTTG ATG CT CTTG ATCTT C C A 3 • 
TZ-4 : 5 1 AATTCGCCAGCCATGCCAAAAAAGAAGAGG3 » 
The TZ-4 primer is common to both primer pairs 
and is the 5 1 primer which encompasses the start 
codon of the tetR and mRNA. Primer TZ-1 and TZ-3 
are two different 3 • primers both of which are in 
the tetR coding region. When amplified, these 
primer pairs produced smaller then expected 
products (approx. 215bp vs. 619bp for TZ-4 and 
TZ-1, and approx. 94bp vs. 498bp for TZ-4 and 
TZ-3) . The products of this reaction were cloned 
and sequenced. Sequencing revealed the presence 
of an unexpected intron which spanned from near 
the Xbal site at the start of tetR to a splice 
acceptor just 8 base pairs 5 1 of the TZ-3 primer. 
Figure 14. Composition analysis of Wild Type TnlO 
tetR gene. The TnlO tetR coding sequence was 
analyzed on a desktop computer using Mac Vector 
software. The figure shows a diagram of the tetR 
coding region with the plus strand splice doner 
(D) and splice acceptor (A) signal sequences 
indicated. For reference the location of the 
Xbal restriction is also indicated. The first 
graph depicts the percentage of G and C bases in 
the coding region of tetR. There are several 
domains of very low GC content. The bottom graph 
is an analysis of codon bias. The dark line is a 
comparison of the tetR codon usage to a mouse 
codon bias table. Values lower than 1.0 are 
indicative of sequences which may translate 
poorly. For reference, a comparison of tetR to a 
Tobacco codon bias table is included (light 
line) . In transgenic tobacco, the tetR 
regulation system functions very efficiently, 
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suggesting that for this gene, codon bias may be 
an important factor for efficient expression. 
Figure 15, Synthetic tetR Component Sequences. The 
5 components of the synthetic tetR gene were 

synthesized by Midland Laboratories as four 
overlapping double stranded DNA cassettes. The 
sequence of these cassettes are shown. Each 
cassette was blunt cloned into the Hinc2 site of 

10 pUCl9 and sequenced to verify authenticity. The 

resulting plasmids pLTl, pLT2, pLT3 and pLT5 can 
be used as the source material to assemble the 
entire synthetic tetR coding sequence since each 
contains an overlapping unique restriction site 

15 (bold face) through which they can be joined. 

Figure 16. Sequence of Synthetic tetR gene. 
Figure 17. Composition analysis of synthetic tetR. 
These graphs were produced using the same 
software described in Figure 15. The figure 

20 depicts the structure of the synchetic tetR gene, 

now devoid of splice donor signal sequences, with 
only a single splice acceptor signal remaining 
(A) . This is not the splice acceptor which was 
active in the 345 construct. The percentage of G 

25 and C bases has been significantly improved , 

while the frequency of CpG base pairs has been 
kept to a minimum. A CpG base pair is frequently 
the site for DNA methylation, which can 
negatively effect the expression of a gene. The 

3 0 codon bias of the synthetic tetR gene is also 

vastly improved. The graph depicts the results 
when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
previously. 
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5. DETAILED DESCRIPTION OF THE INVENTION 
For purposes of clarity of description, and not 
by way of limitation, the detailed description of the 
invention is divided into the following subsections: 

(i) the tetR operator; 

(ii) modified promoters containing the tetR 
operator; and 

(iii) utility of the invention. 



5.1. THE TETR OPERATOR 
In order to practice the instant invention, the 
tetR operator sequence is inserted into a suitable 
animal promoter sequence in order to render that 
15 promoter subject to control by tetR repressor protein. 
A diagram of the tetR operator sequence is depicted in 
Figure 1. 

It may be convenient to clone the tetR operator 
into a vector, such as a plasmid or a phage, to 
2o facilitate its propagation. Cloned operator sequence 
may then be rendered available for insertion into a 
promoter of interest, as set forth in Section 5.2., 
infra . 

In a particular, nonlimiting embodiment of the 
25 invention, tetR operator sequence may be cloned as 

follows: Four oligonucleotides, which when annealed 

produce the two 19bp OP1 and OP2 palindromic sequences 

of the tetR operator may be synthesized; the sequences 

of said oligonucleotides are as follows: 
30 X- 1 . 5 1 ACTCTATCATTGATAGAGT3 1 

X-2 . 5 1 ACTCT ATCAATG ATAG AGT 3 1 

X-3 . 5 1 TCCCTATCAGTGATAGAGA3 f 

X-4 . 5 1 TCTCTATCACTGATAGGGA3 1 

Oligonucleotides X-i and X-2 are complementary and, 
35 when annealed, form the OP1 operator. Similarly, 

oligonucleotides X-3 and X-4 , when annealed, produce 
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the OP2 operator site. The OP1 oligonucleotides may 
then be directly cloned into the EcoRV site of the 
Bluescript (Stratagene) polylinker to form plasmid X. 
5 OP2 oligonucleotides may then be cloned into a Mung 
bean nuclease blunted Clal site of plasmid X to form 
plasmid Y . The resulting tetR operator may then be 
propagated and then excised from plasmid Y as an 
EcoRI, AccI fragment which may be end-filled with T4 
10 polymerase and gel purified • 

It is preferable that the separation between OP1 
and OP2 is about 10-11 bp. 

Analogous methods may be used to insert the tetR 
operator site into other suitable vectors* 

15 

5.2. MODIFIED PROMOTERS CONTAINING 
THE tetR OPERATOR 

According to the invention , the tetR operator may 
be inserted into a suitable animal promoter so as to 

20 render that promoter subject to repression by tetR 
repressor protein. Any animal promoter maybe used; 
strategies for promoter selection are set forth in 
Section 5.3. . infra . 

In preferred embodiments of the invention, the 

25 tetR operator sequence is positioned 3 1 to the TATA- 
box sequence. A nonlimiting list of promoters which 
may be used according to the invention is set forth in 
Figure 6, together with the proximal portion of the 
promoter in the vicinity of the TATA-box, which is 

30 underlined. 

In a specific, nonlimiting embodiment of the 
invention, the tetR operator site may be inserted into 
the Nhel site of the PEPCK promoter (Wynshaw-Boris et 
al., 1984, J. Biol. Chem. 251:12161-12169). A diagram 

35 of the PEPCK promoter containing the tetR operator 
sequence of the Nhel site is presented in Figure 2. 
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For insertion of the operator sequence, the PEPCK 
promoter may be cut with Nhel and end-filled with T4 
polymerase; tetR operator, prepared as set forth in 
5 Section 5.1., supra , may then be blunt-ligated into 
place. 

5.3. UTILITY OF THE INVENTION 
5.3.1. STRATEGY 
10 The strategy of the invention is to prepare a 

non-human transgenic animal that comprises two 
transgenes. The first transgene, termed "A, M is a 
gene of interest, the expression of which is desirably 
controlled. Virtually any gene of interest may be 
15 used, including, but not limited to, growth hormone, 
hemoglobin, low density lipoprotein receptor, insulin, 
genes set forth in Table I, etc. 

TABLE 1 
Other Genes Of Interest 
Gene Disease/Affect 
ADA Adenosine deaminase Immuno-def iciency 

TNF Tumor necrosis factor Anti-cancer 
IL-2 Interleukin-2 Anti-cancer 
LDL low density hypercholesterolemia 
Factor IX hemophelia 
Factor VIII hemophelia 
/J-glucosidase Gauchers disease 

CFTR Cystic fibrosis Cystic fibrosis 

transmembrane regulator 
HPRT Hypoxanthine-guanine Lesch-Nyhan syndrome 

phosphor ibosyltransf erase 
UDP-glucuronyl transferase Crigler-Naj jar syndrome 
Growth Hormone receptor Growth 
Insulin-like growth factor Growth 
Growth hormone releasing Growth 
factor 
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The expression of gene M A" is under the 
transcriptional control of promoter H B". Promoter B 
comprises a tetR operator sequence , as discussed 
5 supra . Promoter B desirably defines the time and 
tissue window in which the transgene may be induced; 
for example, promoter A may be a tissue specific 
promoter such as the PEPCK promoter (which is 
expressed selectively in liver and becomes active 

10 shortly prior to birth) . The second transgene encodes 
the tetR repressor, the sequence of which is set forth 
in Figure 5. 

Analysis of the TnlO tetR coding sequence 
indicates that the codon usage for this gene is poorly 

15 suited for expression in mammalian cells (FIG. 15) . 
To optimize tetR expression in mammalian cells a new 
tetR repressor gene was designed ( See , Section 7, 
infra ) , which may be utilized in alternative 
embodiments of the invention. The synthetic tetR gene 

2 0 (syn-tetR) is designed to encode exactly the same 
protein product as the bacterial TnlO tetR gene but 
optimizes codon usage for mammalian cells. The 
percentage of G and C bases has been significantly 
improved, while the frequency of CpG base pairs has 

25 been minimized. A CpG base pair is frequently the 
site for DNA methylation which can negatively affect 
the expression of a gene. In addition, the syn-tetR 
gene is devoid of any splice signals, decreasing the 
likelihood of aberrant splicing of the RNA which may 

30 result in production of a non-functional message. The 
sequence of the synthetic tetR gene is depicted in 
Figure 16. Plasmids comprising these sequences may be 
constructed using plasmids pLT-1, pLT-2, pLT-3 and 
pLT-5 (deposited with the American Type, Culture 

35 Collection (ATCC) and assigned accession numbers 
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and , as described in 



Section 7, infra / 

In further embodiments, the present invention 
5 provides for additional synthetic tetR genes from 
which one or more splice sites have been deleted or 
for which codon usage has been further optimized. 

The present invention covers synthetic tetR genes 
having the sequence set forth in Figure 16 and for 

10 functionally equivalent variants of that sequence* 
In specific, non-limiting embodiments of the 
invention, a nuclear localization signal may be added 
to a natural or synthetic tetR gene to facilitate its 
expression ( See . Section 7, infra) . 

15 Expression of tetR is controlled by promoter M C". 

While it is preferable that promoter C be the same as 
promoter B except that promoter C does not contain a 
tetR operator sequence, any promoter which provides 
expression of tetR so as to repress expression of gene 

20 "A" during the period when it is desirable to repress 
expression of "A" may be used. 

For example, and not by way of limitation, a 
transgenic animal may be produced which carries a 
first transgene which is bovine growth hormone under 

25 the control of a PEPCK promoter modified to contain a 
tetR operator sequence at the Nhel site and a second 
transgene which is tetR repressor protein under the 
control of an unmodified PEPCK promoter; see Section 
6, infra . The pPCK_NbGH construct has been deposited 

30 with the ATCC and assigned accession number . 

5.3.2. TRANSGENIC ANIMALS OF 
THE INVENTION 

The binary repressor system of the invention may 

be used to control gene expression in any non-human 

35 

transgenic animal, including, but not limited to, 
transgenic mice, pigs, goats, cows, rabbits, sheep, 
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etc. The present invention provides for such non- 
human transgenic animals earring as transgenes nucleic 
acid constructs described herein, including natural or 
5 synthetic tetR repressor proteins and operator 
sequences. 

Transgenes may be introduced by microinjection, 
transf ection, transduction, electroporation, cell gun, 
embryonic stem cell fusion, or any other method known 
X0 in the art. The transgenes of the invention may be 
co-introduced into a single animal or may be 
introduced into two individual animals that are 
subsequently mated to produce doubly transgenic 
offspring. 

X5 For example , for the production of transgenic 

mice, the following general protocol may be used. 
Male and female mice are mated at midnight. Twelve 
hours later, the female may be sacrificed and the 
fertilized eggs may be removed from the uterine tubes. 

2o Foreign DNA may then be microinjected (100-1000 
molecules per egg) into a pronucleus. Shortly 
thereafter, fusion of the pronuclei (a pronucleus or 
the male pronucleus) occurs, and, in some cases, 
foreign DNA inserts into (usually) one chromosome of 

25 the fertilized egg or zygote. The zygote may then be 
implanted into a pseudo-pregnant female mouse 
(previously mated with a vasectomized male) where the 
embryo develops for the full gestation period of 20-21 
days. The surrogate mother then delivers the mice and 

30 by four weeks transgenic pups may be weaned from the 
mother . 

According to another embodiment of the invention, 
a transgenic pig may be produced, briefly, as follows. 
Estrus may be synchronized in sexually mature gilts 
35 (>7 months of age) by feeding an orally active 

progestogen (e.g. allyl trenbolone, AT: 15mg/gilt/day) 
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for 12 to 14 days. On the last day of AT feeding all 
gilts may be given an intramuscular injection of 
prostaglandin F^ (Lutalyse: 10mg/ injection) at 0800 
5 and 1600 hours. Twenty-four hours after the last day 
of AT consumption all donor gilts may be administered 
a single intramuscular injection of pregnant mare 
serum gonadotrophin (1500 U) . Human chorionic 
gonadotrophin (750 IU) may be administered to all 

10 donors at 80 hours after pregnant mare serum 
gonadotrophin . 

Following AT withdrawal, donor and recipient 
gilts may be checked twice daily for signs of estrus 
using a mature boar. Donors which exhibited estrus 

X5 within 3 6 hours following human chorionic 

gonadotrophin administration may be bred at 12 and 24 
hours after the onset of estrus using artificial and 
natural (respectively) insemination. 

Between 59 and 66 hours after the administration 

20 of HCG one- and two-cell ova may be surgically 
recovered from bred donors using the following 
procedure. General anesthesia may be induced by 
administering 0.5 mg of acepromazine/kg of bodyweight 
and 1.3 mg of ketamine/kg via a peripheral ear vein. 

25 Following anesthetization, the reproductive tract may 
be exteriorized following a mid-ventral laparotomy. A 
drawn glass cannula (O.D. 5 mm, length 8 cm) may be 
inserted into the ostium of the oviduct and anchored 
to the infundibulum using a single silk (2-0) suture. 

3 0 Ova may then be flushed in retrograde fashion by 

inserting a 20g needle into the lumen of the oviduct 2 
cm anterior to the uterotubal junction. Sterile 
Dulbecco's phosphate buffered saline (PBS) 
supplemented with 0.4% bovine serum albumin (BSA) may 

35 be infused into the oviduct and flushed toward the 
glass cannula. The medium may be collected into 
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sterile 17 x 100 mm polystyrene tubes. Flushings may 
be transferred to 10 x 60 mm petri dishes and searched 
at a lower power (5 Ox) using a Wild M3 
5 stereomicroscope. All one- and two- cell ova may be 
washed twice in Brinster's Modified Ova Culture -3 
medium (BMOC -3) supplemented with 1.5% BSA and 
transferred to 50 ill drops of BM003 medium under oil. 
Ova may be stored at 38 °C under a 90% N 2 , 5% 0 2 , 5% Co 2 

i0 atmosphere until microinjection is performed. One and 
two-cell ova may be placed in an Eppendorf tube (15 
ova per tube) containing 1 ml HEPES medium 
supplemented wit 1.5% BSA and centrifuged for 6 
minutes at 14,000g in order to visualize pronuclei in 

15 one-cell and nuclei in two-cell ova. Ova may then be 
transferred to a 5-10/il drop of HEPES medium under oil 
on a depression slide. Microinjection may be 
performed using a Labor lux microscope with Nomarski 
optics and two Leitz micromanipulators. 10-1700 

2 0 molecules of construct DNA (linearized at a 

concentration of about lng/jxl of Tris-EDTA buffer) may 
be injected into one pronucleus in one-cell ova or 
both nuclei in two-cell ova. Microinjected ova may be 
returned to microdrops of BMOC-3 medium under oil and 
25 maintained at 38 °C under a 90% N 2 , 5% C0 2 , 5% 0 2 
atmosphere prior to their transfer to suitable 
recipients. Ova may preferably be transferred within 
10 hours of recovery. Only recipients which exhibit 
estrus on the same day or 24 hours later than the 

3 0 donors may preferably be utilized for embryo transfer. 

Recipients may be anesthetized as described supra . 
Following exteriorization of one oviduct, at least 3 0 
injected one- and/or two-cell ova and 4-6 control ova 
may be transferred in the following manner. The 
35 tubing from a 21g x 3/4 butterfly infusion set may be 
connected to a lcc syringe. The ova and one to two 
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mis of BMOC-3 medium may be aspirated into the tubing. 
The tubing may then be fed through the ostium of the 
oviduct until the tip reaches the lower third or 
5 isthmus of the oviduct. The ova may be subsequently 
expelled as the tubing is slowly withdrawn. The 
exposed portion of the reproductive tract may be 
bathed in a sterile 10% glycerol - 0.9% saline 
solution and returned to the body cavity. The 

10 connective tissue encompassing the linea alba, the 
fat, and the skin may be sutured as three separate 
layers. An uninterrupted Halstead stitch may be used 
to close the linea alba. The fat and skin may be 
closed using a simple continuous and mattress stitch, 

15 respectively. A topical antibacterial agent (e.g. 

Furazolidone) may then be administered to the incision 
area. Recipients may be penned in groups of about 
four and fed 1.8 kg of a standard 16% crude protein 
corn-soybean pelleted ration. Beginning on day 18 

20 ( da Y 0 = onset of estrus) , all recipients may be 

checked daily for signs of estrus using a mature boar. 
On day 35, pregnancy detection may be performed using 
ultrasound. On day 107 of gestation recipients may be 
transferred to the farrowing suite. In order to 

25 ensure attendance at farrowing time, farrowing may be 
induced by the administration of prostaglandin (10 
mg/ injection) at 0800 and 1400 hours on day 112 of 
gestation. In all cases, recipients may be expected 
to farrow with 34 hours following PGF 2a 

3 o administration . 

As used herein, the term "transgenic animal" 
refers to animals that carry a transgene in at least 
some of their somatic cells, and preferably in at 
least some of their germ cells. 
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5,3.3. INDUCTION 
Induction of expression of the gene of interest 
in transgenic animals of the invention may be achieved 
5 by administering, to the animal , a compound that binds 
to tetR so that tetR repressor function is inhibited. 
Examples of such compounds include tetracycline and 
tetracycline-like compounds, including, but not 
limited to, apicycline, chlortetracycline, 

10 clomocycline, demeclocyline, guamecycline, lymecycline, 
meclocycline , methacycline , minocycline , 
oxytetracycline , penimepicycline , pipacycline , 
rolitetracycline, sancycline, and senociclin. 

Administration of the inducer can be through 

15 direct injection, water, feed, aerosol, or topical 

application. The choice of method will depend on the 
promoters used and the specific application of the 
transgenic animals. For example, injection, water and 
feed would provide inducer to all of the animals 

20 tissues. In our case, administration through water or 
feed would be the preferred method to control growth 
hormone expression in transgenic pigs. Aerosol spray 
could be used to attain high antibiotic concentrations 
in the lung. This may be appropriate for example in a 

25 cystic fibrosis or emphysema model. Topical 

application to the skin is also possible and could be 
used in models of acne, hair loss, wound healing or 
viral infection. 

Induction of the gene of interest is accomplished 

30 by administering an effective amount of inducer, as 
described above. An effective amount of inducer may 
be construed to mean that amount which increases 
expression of the gene of interest by at least about 
50 percent. As the LD 50 for tetracycline HC1 in rats 

35 is about 6643 mg/kg and the therapeutic dose is 
between about 25-50 mg/kg, an effective dose of 
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tetracycline, as inducer, is between about 5-50 mg/kg 
and preferably between about 5-15 mg/kg. 

5 6. EXAMPLE: TETRACYCLINE REPRESSOR-MEDIATED 
BINARY REGULATION SYSTEM FOR CONTROL OF 
BOVINE GROWTH HORMONE EXPRESSION IN 
TRANSGENIC M?CE 

6.1. MATERIALS AND METHODS 

6.1.1. CONSTRUCTION OF PLASMIDS 

X0 Plasmid pddl contains a functional tetR operator 

site cloned within a Bluescript (Stratagene) 
poly linker. This plasmid is useful for propagating 
the operator sequence, and as a source of operator 
sites for insertion into the PEPCK promoter or any 

15 other promoter element. The pddl plasmid was made as 
follows. Four oligonucleotides, which when annealed 
produce the two 19bp OP1 and OP2 palindromic sequences 
of the tetR operator were synthesized. The sequences 
of each oligonucleotide is listed below. 

2 0 X-l . 5 • ACTCTATCATTGATAGAGT 3 1 

X-2.5 1 ACTCTATCAATGATAGAGT 3 1 

X-3.5' TCCCTATCAGTGATAGAGA 3' 

X-4 . 5 1 TCTCTATCACTGATAGGGA 3 1 
Oligonucleotides X-l and X-2 are complementary and 
25 when annealed form the OPl operator. Similarly 

oligonucleotides X-3 and X-4 produce the OP2 operator 
site. The OPl oligonucleotides were directly cloned 
into the EcoRV site of the Bluescript polylinker. The 
resulting plasmid pSOPI was sequenced to verify the 

3 0 integrity of the insert. 0P2 oligonucleotides were 

subsequently cloned into a Mung bean nuclease blunted 
Clal site of pSOPI to produce pdd7. Due to a cloning 
artifact produced by the Mung bean nuclease, the 
operator in pddl consists of the two 19bp OPl and OP2 
35 sequences separated by linker of only 10 base pairs. 
This difference does not effect tetR binding. The 
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sequence of the pddl operator site is shown in Figure 
IB. The 55 base pair tetR operator was excised from 
pddl as an EcoRl, AccI fragment, end filled with T4 
5 polymerase, and gel purified. This fragment was 
subsequently used to produce the modified PEPCK 
promoters Pck_N and Pck_A. 

Plasmids Pck_A and Pck_N were produced by 
inserting the 55bp tetR operator into the unique AccI 

10 and Nhel sites (respectively) of the PEPCK promoter 
(pPCK_NbGH has been deposited with ATTC and assigned 
accession No: ) . For both plasmids the promoter was 
cut with the appropriate restriction enzyme, end 
filled with T4 polymerase and the tetR operator blunt 

15 ligated into place. A third modified PEPCK promoter, 
PckJT was produced in which the OP1 and OP2 operator 
sequences were positioned to flank the PEPCK TATA-box 
element. To produce PckJT a new oligonucleotide 
(5 1 ACTCTATCATTGATAGAGTTACTAT 

20 TTAAATCCCTATCAGTGATAGAGA3 • ) was produced. This 

oligonucleotide was kinased with T4 polynucleotide 
kinase and annealed to kinased X-2 and X-4 which are 
complementary to the first and last 19bp. The 
complete double stranded 49bp operator was produced by 

25 filling in the llbp linker region, which includes the 
PEPCK TATA- box element, with Klenow. The final 
product was then blunt cloned into an AccI, Nhel cut 
PEPCK promoter. All three modified promoters were 
sequenced to verify the inserts. Figure 2 depicts the 

30 structure of these promoters. 

6.1.2. REPRESSOR CONSTRUCT 
Plasmid pBI501 contains a 701 bp Hindi fragment 
from E. coli TnlO, cloned into the Hindi site of 
35 pUC8. The Hindi insert contains the entire tetR 

coding sequence along with 21bp of 5 1 and 55bp of 3 1 
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untranslated DNA. This insert was excised from the 
parent plasmid and subcloned into a plasmid with a 
more suitable polylinker to produce pSTET7. To this 
5 plasmid a 870bp Xhol, BamHI fragment derived from pMSG 
(Pharmacia) , containing the SV40 small-T intron and 
polyadenylation signal sequences was inserted at the 
Hindu site 3* of the tetR coding region to produce 
pSTetRSv. Finally an unmodified 610bp PEPCK promoter 

10 was inserted at the EcoRl site of pSTETRSv to produce 
pPckjtetRSv. The PEPCK promoter is identical to the 
promoter used to produce pPck_A, pPck_N, and pPck_T 
except that it does not contain a tetR operator site. 
This PEPCK promoter has been previously used in 

15 transgenic animals and is known to target gene 
expression specifically to the liver. 

6.1.3. GROWTH HORMONE GENES 
Plasmid pGH-SAF107 contains a 2.2kb BamHI, EcoRI 
genomic fragment of the bovine growth hormone (bGH) 
gene, blunt ligated into an EcoRV site. To this 
vector each of the modified PEPCK promoters was added 
by blunt ligating the promoter into the BamHI site of 
pGH-SAF107. The structure of the resulting plasmids 
is depicted in Figure 3. Plasmid pPCK_NbGH was 
deposited with the ATCC and assigned accession number 

. For production of transgenic animals > 

each of the PEPCK-bGH genes was excised from the 
vector using Xhol and Sacl, gel fractionated and 
purified using an Elutip column. 

6.1.4. TRANSGENIC MICE 
Transgenic mice were made which contain both the 
Pck_tetRSv gene and one of the modified PEPCK 
promoters controlling bGH. Table 2 lists the number 



25 
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of eggs injected, offspring produced and number of 
transgenics derived" for each construct. 



TABLE 2 





Construct 


Eggs 

injected 


Eggs 

transferred 


Live 
Born 


Transgenic 


Pck AbGH + Pck tetRSv 
" (251) 

Pck-NbGH + Pck tetRSv 
(252) 

Pck TbGH + Pck-tetRSv 
" (261> 


233 
268 
227 


194 
208 
197 


40 

30 
25 


14 (0.35) 

9(0.3) 

5(0.2) 


6.2. 


PFISTJTVTS AND 


HTSCUSSION 







15 



once the transgenic founder animals were 
identified, they were weighed each week. Table 3 
lists the mean weights of each group of transgenic 
animal at 11 weeks of age. 



TABLE 3 





Construct "H 


Sex 


1 


weight 


Pck_ 
Pck_ 


AbGH + Pck_tetRSv(9) 
AbGH + Pck_tetRSv(4) 


male 
female 


36. 
29. 


122(12.251) 
125(7.861) 


Pck_ 
Pck_ 


NbGH + Pck_tetRSv(5) 
NbGH + Pck_tetRSv(4) 


male 
female 


34. 
28. 


840(14 .745) 
125(10.958) 


Pck_ 
Pck_ 


TbGH + Pck_tetRSv(3) 
TbGH + Pck_tetRSv(2) 


male 
female 


36. 
27. 


267 (11.402) 
300(5.798) 


NON 
NON 


-TRANSGENIC (6) 
-TRANSGENIC (6) 


male 
female 


29. 
23 


.583(2.395) 
.117(1.863) 



As expected for each co-injection, large animals, 
3S obviously expressing elevated levels of bGH, were 
observed as were animals of normal stature. 
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At 10 weeks of age, a sampling of transgenic 
female founders containing the A+T and N+T were tested 
for induction of bGH in the serum using a radio-immune 
5 assay, after a single IP injection of 60 mg/kg 

tetracycline-HCl. The purpose of this experiment was 
simply to determine which if either of these two 
modified promoters was responsive to repression by 



tetR. The 


results are 


summarized in Table 


4. 








TABLE 4 








Construct 


Animal 


Weight 


Basal 


12 hours 


36 hours 


249 


2-5 female 


21.1 


0.00 


0.00 


0.00 


250 


6-6 female 


42.9 


4.6+0.033 


3.4+0.062 


4.9+0.072 


251 


6-6 female 


19.3 


0.00 


0.00 


0.00 


251 


10-5 female 


25.1 


0.20+0.008 


0.19+0.001 


0.21+0.038 


252 


5-2 female 


38.7 


0.59+0.107 


0.64+0.044 


1.12+0.207 


252 


5-3 female 


20.0 


0.00 


0.00 


0.00 


252 


10-2 


19.2 


0.00 


0.00 


0.00 



No induction of bGH was observed in animals that lack 
the Pck_tetRSV gene (construct 250) or in animals with 

25 both the Pck_AbGH + Pck-tetRSv genes (construct 251) . 
An approximate two fold increase in serum bGH levels 
was however detected in the 5-2 female which contains 
the Pck-NbGH + Pck_tetRSV genes. The remainder of the 
animals had undetectable levels of bGH expression , due 

30 in part to the relatively low sensitivity of this 
assay. For example the 10-2 female (construct 252) 
shows no detectable bGH in the serum, but subsequent 
experiments on her offspring indicate that this line 
of animals does express bGH mRNA in a tetracycline 

35 dependent manner. This initial data, suggested that 
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the Pck_N promoter was being regulated by tetR at 
least to a limited extent. 

To further characterize the mice, improve the 
5 sensitivity of the assay and to test the 

responsiveness of the Pck_T promoter, offspring of 
founder mice from each co-injection were produced. 
The transgenic progeny were then raised in the 
presence or absence of tetracycline medicated water 

10 (500/ig/ml) for 4 weeks, prior to analysis of bGH mRNA 
expression levels in the liver, the predominant site 
of PEPCK expression. Northern blot hybridization 
analysis of these animals (Figure 7) demonstrated 
again, that animals with the Pck_NbGH gene were 

i5 responsive to repression by tetR and that the other 
two modified promoters exhibited no signs of tetR 
dependent regulation. 

We attempted to breed all of the remaining 
founders containing the Pck-NgGH + Pck_tetRSv genes to 

20 analyze their offspring in a similar manner (Figure 

8) . Of the 5 founders which produced offspring, 2 did 
not express bGH under any conditions, and from the 
remaining 3 one segregated two different integration 
sites allowing us to establish a total of 4 lines. 

25 All 4 lines exhibited tetracycline dependent bGH 

expression as assayed by Northern blot hybridization. 
The efficiency of tetR repression appeared to be 
inversely correlated with the level of expression. 
For example 9-5 animals have the highest level of bGH 

3 0 expression, show an obvious increase in body size, and 
exhibit only marginal tetR repression. In contrast 9- 
4Lc and 10-2 animals exhibit lower levels of 
tetracycline induced bGH expression, are of normal 
stature and appear to be efficiently regulated by 

35 tetR. 
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An SI nuclease protection assay was performed to 
identify the start site of transcription of bGH mRNA. 
As shown in Figure 4, there was only one start site 
5 identified regardless of the presence or absence of 
tetR repressor binding. This start site was located 
approximately 20 bp downstream from the TATA-box. At 
this location, the message is initiating within the 
537 operator sequence, just 3 or 4 base pairs 5 1 of 
10 the first tetR binding site. 

7. EXAMPLE: OPTIMIZATION OF tetR CODING SEQUENCE 
The use of the wild type TnlO tetR gene in 
conjunction with the 252 construct indicates that the 

15 TetR system can function in transgenic animals and 
that in some cases, for instance in the 10-2 
transgenic animals , the level of regulation can be 
very high (FIGS. 9A and 9B) . However, in other 
instances the efficiency of repression is not always 

20 complete, leading to a significant basal level of bGH 
expression. This failure to repress may be due to low 
level expression of tetR. To optimize the expression 
of tetR repressor, a synthetic tetR gene was generated 
which was devoid of splice signals and had optimized 

25 codon usage for mammalian cells. 



7.1 MATERIALS AND METHODS 

7.1.1. TISSUE SPECIFICITY AND TETRACYCLINE 
INTRODUCTION OF bGH IN LINE 10-2 

30 

For all Northern blots lOjig of whole RNA was 
electrophoreses through a 1% agarose gel containing 3% 
formaldehyde using standard techniques. To detect bGH 
mRNA a random primed, radioactive bGH cDNA probe was 
35 used. All conditions for hybridization and washing of 
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filters were done in accordance with standard 
techniques of molecular biology. 

5 7.1.2. EXPRESSION AND ALTERNATIVE PROCESSING 

OF THE tetR TRANSGENE 

A RNase protection probe which extended from the 
Nrul site of tetR 3 % to the end of the gene was used. 
This probe includes only tetR coding sequences and 

10 should give a fully protected fragment of 

approximately 400 base pair. When hybridized to 150/xg 
of liver RNA (500,000 cpm of probe in a 30^1 
hybridization consisting of 80% formamide, 4 0mM PIPES 
pH 6.4, 400mM NaOAc , and ImM EDTA) , and digested with 

15 RNase one (Promega) for 30 minutes at 37° as 

recommended by the manufacturer, a protected fragment 
of approximately 221-260 base pairs is observed, far 
smaller than predicted. 

20 7.1.3. 5' STRUCTURE OF tetR mRNA 

Liver RNA was treated with reverse transcriptase 
and amplified by PCR using the manufacturers 
recommended conditions (Pharmacia) . The RNA was 
amplified using two different pairs of primers. The 

25 first primer pair (TZ-1 and TZ-4) should produce a 619 
base pair product. The second primer pair (TZ-3 and 
TZ-4) should produce a 498 base pair product. The 
sequence of the primers are: 
TZ-1 : 5 1 CCGCATATGATCAATTCAAGGCCGAATAAG3 ■ 

30 TZ-3 : 5 1 CTTTAG CG ACTTG ATG CTCTTG ATCTTC CA 3 ■ 
TZ-4 : 5 1 AATTCGCCAGCCATGCCAAAAAAGAAGAGG3 1 

The TZ-4 primer is common to both primer pairs 
and is the 5' primer which encompasses the start codon 
of the tetR mRNA. Primer TZ-1 and TZ-3 are two 

35 different 3 1 primers both of which are in the tetR 
coding region. When amplified, these primer pairs 
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produce smaller than expected products (approx. 215bp 
vs. 619bp for TZ-4 and TZ-1, and approx, 94bp vs. 
498bp for TZ-4 and TZ-3). The products of this 
5 reaction were cloned and sequenced. The sequence 
revealed the presence of an unexpected intron which 
spanned from near the Xbal site at the start of tetR 
to a splice acceptor just 8 base pairs 5' of the TZ-3 
primer. 

10 

7.1.4. 345 REPRESSOR CONSTRUCT 

In an embodiment of the invention, any nuclear 
localization signal may be added to a natural or 
synthetic tetR gene to facilitate its expression. For 

15 example, complementary oligonucleotides which encode a 
nuclear localization signal sequence were synthesized 
(Oligos etc.) and added in frame to the tetR coding 
sequences of pSTETR107 at the EcoRl and Xbal 
restriction sites to produce pNTETR. Oligonucleotide 

20 sequences are: 

(GB1) 5 1 AATTCGCCAGCCATGCCAAAAAAGAAGAGGAAGGTAT3 1 and 
(GB2 ) 5 1 CTAGATACCTTCCTCTTCTTTTTTGGCATGGCTGGC3 1 . 
When annealed these oligonucleotides have a 5 1 EcoRl 
and 3' Xbal compatible overhangs. These 

25 oligonucleotides fuse the amino acid sequence Met Pro 
Lys Lys Lys Arg, Lys Val,to the third amino acid (Arg) 
of wild type tetR. 

Two complementary 51 base pair oligonucleotides 
which start the 5' cap site of bGH and extend to the 

30 first exon were synthesized (Oligos etc.). Sequence 
for the oligonucleotides are (5b-l) : 

5 • GATCCCAGGACCCAGTTCACCAGACGACTCAGGGTCCTGTGGACAGCT 
CAG3 1 

and (5b-2): 

35 5 1 AATTCTGAGCTGTCCACAGGACCCTGAGTCGTCTGGTGAACTGGGTCC 
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TGG3 ' • When annealed these oligonucleotides have 5' 
BamHl and 3 1 EcoRl compatible overhands. The 
oligonucleotides for the 5' leader sequence of bGH 
5 were cloned into a BamHl, EcoRl cut plasmid to produce 
p5'GH. 

The nuclear localization modified tetR coding 
sequence was isolated by gel purification after 
restriction digestion of pNTETR using EcoRl and Hind 

10 III. This fragment was then inserted into p5'GH at 
the EcoRl and Hind III sites to product p5 1 GHTR. 

To add the remainder of the bGH genomic sequence 
an intermediate modification of p5 1 GHTR was first 
made. This modification consisted of adding a 

15 Hind III - Pstl linker to the Hind III site of p5 1 GHTR 
to product pGTO. The sequence of the oligonucleotides 
which comprise this linker are: (CC-1) 
5 • AGCTTCTGCAG3 ■ and ( CC-2 ) 5 ■ AGCTCTGCAGA3 • . The 
remaining bGH genomic sequences were added in two 

2o steps. First the Pstl Sac2 fragment that begins in 

the first exon of bGH and ends in the third intron was 
excised from pSGH107. Similarly, the insert of pGTO 
which contains the 5 1 untranslated leader of bGH and 
the nuclear localization modified tetR was excised 

25 using BamHl and Pstl. These two gel purified 

fragments was then cloned into a BamHl Sac2 cut vector 
to produce pGTG. Finally, the remainder of the bGH 
gene from the Sac2 site in the third intron to the end 
of the gene, was added to pGTG by cutting pGTG with 

30 Sac2 and adding the Sac2 fragment from pSGH106 to 
produce pNTETR-GH. 

Plasmid pNTETR-GH was digested with BamHl to 
excise the NTETR-GH gene. The fragment was cloned 
into the BamHl site of pPCK 305 to produce the final 

35 plasmid pPCK-GHNTET. To produce transgenic mice, the 
PEPCK-GHTET gene was excised from the plasmid using 
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Sail and Sacl. This fragment was gel purified and 
coinjected with the PCK-NbGH gene previously described 
to generate transgenic mice. 

5 

7.1.5. SYNTHETIC tetR COMPONENT SEQUENCES 
The components of the synthetic tetR gene were 
synthesized by Midland Laboratories as four 
overlapping double stranded DNA cassettes. The 

10 sequence of these cassettes are shown in Figure 15. 

Each cassette was blunt cloned into the Hinc2 site of 
pUC19 and sequenced to verify authenticity. The 
resulting plasmids pLTl, pLT2, pLT3 and pLT5 can be 
used as the source material to assemble the entire 

15 synthetic tetR coding sequence since each contains an 
overlapping unique restriction site (bold face) 
through which they can be joined (pLT-1, pLT-2, pLT-3 
and pLT-5 have been deposited with ATCC and have been 
assigned accession numbers , , , and 

2 0 respectively) . There are many possible ways by which 

these cassettes can be joined. By way of an example, 
the inserts of plasmid pLTl and pLT2 can be excised 
using EcoRl and Nsil. The inserts can then be 
combined by cloning these two fragments into an EcoRl 

25 vector. This procedure will assemble the 5' half of 
the gene, using the overlapping Nsil restriction site 
to join the pieces. Similarly, the 3' half of the 
gene can be assembled from pLT3 and pLT5 by cutting 
with EcoRl and Sphl (pLT3) and Sphl and Hind III 

30 (pLT5) to release the inserts. These inserts can then 
be joined at the overlapping Sphl site by cloning the 
fragments into an EcoRl, Hind III cut vector. 
Finally, the entire coding region can be put together 
using the overlapping restriction site ApaLl . This 

3 5 would result in a vector with the synthetic tetR 
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coding sequence, as depicted in Figure 16 , cloned into 
a plasmid as an EooRl Hind III fragment. 

5 7.1.6. COMPOSITIONAL ANALYSIS OF 

WILD TYPE TnlO tetR GENE 

The TnlO tetR coding sequence was analyzed on a 
desktop computer using Mac Vector software. Figure 14 
shows a diagram of the tetR coding region with all of 

10 the plus strand splice doner (D) and splice acceptor 
(A) signal sequences indicated. For reference the 
location of the Xbal restriction is also indicated. 
The first graph depicts the percentage of G and C 
bases in the coding region of tetR. There are several 

15 domains of very low GC content. The bottom graph is 
an analysis of codon bias. The dark line is a 
comparison of the tetR codon usage to a mouse codon 
bias table. Values much lower than 1.0 are indicative 
of sequences which may translate poorly. For 

20 reference, a comparison of tetR to a Tobacco codon 
bias table is included (light line). In transgenic 
tobacco, the tetR regulation system functions very 
efficiently, suggesting that for this gene, codon bias 
may be an important factor for efficient expression. 

25 

7.1.7. COMPOSITIONAL ANALYSIS OF SYNTHETIC tetR 
Figure 17 depicts the structure of the synthetic 
tetR gene, now devoid of splice donor signal 
sequences, with only a single splice acceptor signal 

30 remaining (A) . This is not the splice acceptor which 
was active in the 345 construct. The percentage of G 
and C bases has been significantly improved, while the 
frequency of CpG base pairs has been kept to a 
minimum. A CpG base pair is frequently the site for 

35 DNA methylation, which can negatively effect the 

expression of a gene. The codon bias of the synthetic 
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tetR gene is vastly improved. The graph depicts the 
results when the synthetic tetR coding sequence is 
compared to the same mouse codon bias table used 
5 previously. 

7.2 RESULTS 

7.2.1. EXPRESSION OF tetR IN CONSTRUCT 345 OFFSPRING 
To improve tetR expression a new repressor 

10 construct was produced. The construct, referred to as 
Construct 345 is depicted in Figure 10. In the 345 
construct the coding region of tetR is augmented with 
a nuclear localization signal sequence to increase the 
nuclear concentration of repressor. The tetR coding 

15 region was inserted into the first exon of the bGH 
gene. The bGH gene then acts as a genomic carrier, 
providing multiple introns, which may improve 
expression, and a strong polyadenylation signal, which 
may improve the processing and stability of the 

20 message. 

The new repressor was coinjected with the bGH 
gene from construct 252. The resulting transgenic 
animals contain the new repressor, and a PEPCK 
regulated bGH gene with the tetR operators located 

25 just 3 1 of the PEPCK TATA •box element. Offspring of 
these animals were screened for bGH induction (FIG. 
11). Of the lines tested only one, line 14 , showed 
tetracycline dependent regulation of bGH, and in this 
one case there was still a significant base level of 

3 0 bGH expression. Northern analysis, performed to 
determine the levels of tetR mRNA expressed in the 
transgenic mice, indicated that the tetR gene was 
still not expressed at a high level. 

To detect tetR mRNA with higher sensitivity the 

35 tetR mRNA was analyzed using RNase protection. This 
technique revealed that the mRNA was shorter then 
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expected (FIG. 12) . Subsequent analysis using reverse 
transcriptase-PCR with primers that amplify the entire 
coding region of tetR confirmed that the mRNA was 
5 significantly shorter then expected (FIG. 13). 

Sequence analysis of these RT-PCR products indicated 
that an unexpected splicing event had occurred. This 
splicing process occurred between a splice donor 
signal in the 5' end of the tetR coding region and a 
10 splice acceptor approximately 400 bp 3 1 of the start 
codon. The resulting mRNA is therefore deleted of the 
tetR DNA binding domain and about two third of the 
entire coding region. This mRNA could not possibly 
make a functional repressor. 

15 

7.2.2. OPTIMIZATION OF tetR CONSTRUCT 
A more detailed analysis of the tetR coding 
sequence indicated that the codons used in this gene 
are poorly suited for expression in mammalian cells 

2o (FIG. 14). Therefore, it appears that the 

inefficiency of the tetR system is the result of two 
processes: (i) aberrant splicing of the RNA to 
produce a nonfunctional message; and (ii) inefficient 
translation which can lead to rapid mRNA turnover. 

25 To circumvent the problems of internal splicing 

and potential problems due to codon bias and G-C 
content, a synthetic tetR gene was designed. The 
components of the synthetic tetR gene were synthesized 
as four overlapping double stranded cassettes. Each 

30 cassette was cloned in pucl9. The resulting plasmids 
designated pLT-1, pLT-2 , pLT-3 and pLT-5, as depicted 
in Figure 15, have been deposited with ATCC and 

assigned accession numbers , , , and 

, respectively. The synthetic tetR (syn-tetR) 

35 has been designed to encode exactly the same protein 
product, but is devoid of splice signals and has 
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greatly improved codon usage for mammalian cells. The 
sequence of the of the syn-tetR is indicated in Figure 
16. The predicted analysis for splicing signals, G+C 
5 content, and codon usage are depicted in Figure 17. 

8. DEPOSIT OF MICROORGANISMS 
The following microorganisms have been deposited 
with the American Type Culture Col lection , (ATCC) , 
10 Rockville, Maryland and have been assigned the 
following accession numbers: 

Microorganism Date of Deposit Accession No. 

pLT-1 August 25, 1993 

pLT-2 August 25, 199 3 

15 pLT-3 August 25, 1993 

pLT-5 August 25, 1993 

pPCK_NbGH August 25, 199 3 

The present invention is not to be limited in 
scope by the microorganisms deposited since the 
2 0 deposited embodiments are intended as illustrations of 
single aspects of the invention and any microorganisms 
which are functionally equivalent are within the scope 
of the invention. 

The present invention is not: to be limited in 
25 scope by the exemplified embodiments which are 

intended as illustrations of single aspects of the 
invention, and any clones, DNA or amino acid sequences 
which are functionally equivalent are within the scope 
of the invention. Indeed, various modifications of 
3q the invention in addition to those described herein 
will become apparent to those skilled in the art from 
the foregoing description and accompanying drawings. 
Such modifications are intended to fall within the 
scope of the appended claims. 



35 
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It is also to be understood that all base pair 
sizes given for nucleotides are approximate and are 
used for purposes of description, 
5 Various publications are cited herein, which are 

hereby incorporated by reference in their entirety. 



10 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Byrne, Guerard 

(ii) TITLE OF INVENTION: TETRACYCLINE REPRESSOR-MEDIATED BINARY 
REGULATION SYSTEM FOR CONTROL OF GENE EXPRESSION IN 
TRANSGENIC ANIMALS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/935,763 

(B) FILING DATE: 26-AUG-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE /DOCKET NUMBER: 6794-025 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 790-9090 

(B) TELEFAX: 212 869-8864/9741 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TTGACACTCT ATCATTGATA GAGTTATTTT ACCACTCCCT ATCAGTGATA GAGAAAAGT 
(2) INFORMATION FOR SEQ ID NO: 2: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GAATTCGATA CTCTATCATT GATAGAGTAT CAAGCTTATC CCTATCAGTG AT AG AG AT AC 60 
CGTCGACCTC 70 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACTCTATCAT TGATAGAGTT ACTATTTAAA TCCCTATCAG TGATAGAGA 49 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GG AATT COAT ACTCTATCAT TGATAGAGTA TCAAGCTTAT CCCTATCAGT GAT AG AG ATA 60 
CCGTCGACCT C 71 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TCT AG A TTA GAT AAA AG? AAA GTG ATT AAC AGC GCA TTA GAG CTG 4S 
Met Ser Arg Leu Asp Lvs Ser Lvs Val lie Asn S „-r Ala Leu Glu Leu 
1 5 10 15 

CTT AAT GAG CTC GGA ATC GrJ-. CGT TTA ACA ACC CGT AAA CTC GCC CAG 9r 
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Leu Aen Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 2 5 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp Hi Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 240 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAC GCT AAA AGT TTT AG A TGT GCT TTA CTA AGT CAT CGC GAT GGA 288 
Asn Asn Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GOG CAT TTT ACT TTA GGT TGC 432 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 * 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 57 6 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCT TAA 624 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Ser Arg Leu Asp Lvs Ser Lvs Val He Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr A~c Lys Leu Ala Gin 
20 * 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 
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Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 

50 55 60 

Thr His Phe Cys Pro Leu Glu Gly <Jlu Ser Trp Gin Asp Phe Leu Arg 

65 70 75 80 

Asn Asn Ala Lys Ser Phe Ara Cys Ala Leu Leu Ser His Arg Asp Gly 

85 " 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 

100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 

115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 

130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 

180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 

195 200 205 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 92 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGGCCCTATA AAAAGCGAAG CGCGCGGCGG GCGGGAGTCG CTGCGTTGCC TTCGCCCCGT 60 
GCCCCGCTCC GCGCCGCCTC GCGCCGCCCG CC 92 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

{ ii ) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAGAAG7A7A TTAGAGCGAG TCTTTCTGCA CACACGATCA COTTTCCTAT CAACCCCACT 6C 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTATTATGTT TTATGTTACT GTAAAAGATG TAAAGAGAGG CACGTGGTTA AGCTCTCGGG 60 
GTGTGGACTC CACC 74 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCCCCAAGC ATAAACCCTG GCGCGCTCGC GGCCCGGCAC TCTTCTGGTC CCCACAGACT 60 
CAGAGAGAAC CCA 73 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAGGCAGCAG G C AT ATGGG A TGGGATATAA AGGGGCTGGA GCACTGAGAG CTGTCAGAGA 
TTTCTCCAAC CCAG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(h) LENGTH: 19 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
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ACTCTATCAT TGATAGAGT 19 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION * SEQ ID NO:13: 
ACTCTATCAA TGATAGAGT 19 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 14: 
TCCCTATCAG TGATAGAGA 19 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTCTATCAC TGATAGGGA 
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Internaiional Application No: PCT/ / 



MICROORGANISMS 

Optional Sheet in connection with the microorganism referred to on paae 38, linec 7-23 of the description ' 



A. IDENTIFICATION OF DEPOSIT ' 

Further deposits are identified on an additional sheet 



Name of depositary institution ' 
American Type Culture Collection 



Address of depositary institution {including postal code and country) ' 

12301 Perklawn Drive 
Rockville, MD 10582 
US 



Date of deposit 1 August 25, 1993 Accession Number * N/A 



B. ADDITIONAL INDICATIONS ' (ks*» blank if a* «ppik»bfe). Thi. tsfenmiicn m c 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' 



D. SEPARATE FURNISHING OF INDICATIONS ' 



The indications teted below will be submitted to the tntsr national Bureau later * (Specify the osnsrsl neturs of the indications 
'Ac e ■■■i on Nimtsf of DoooorT) 



E. S) This sheet was received with the International application when Hied (to be checked by the receiving Office) 



// ^ ft i) 

(Authotfured Officer) A 



G The date of receipt (from the applicant) by the International Bureau ' 



(Authorized Officer) 

Form PCT/RO/134 (January 1981) 
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International Application No: PCT/ / 

Form PCT/RO/134 (com.) 

American Type Culture Collection 

12301 Parklawn Drive 
Rockville, MD 10582 
US 

Date of Deposit 
August 25, 1993 
August 25, 1993 
August 25, 1993 
August 25, 1993 



Accession No. 
N/A 
N/A 
N/A 
N/A 
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WHAT IS CLAIMED IS: 

1. A substantially purified and isolated 
nucleic acid molecule comprising an animal promoter 

5 element that comprises a tetR operator sequence. 

2. The nucleic acid molecule of claim 1 in 
which the tetR operator sequence is positioned 3 9 to a 
TATA-box sequence. 

10 

3 . The nucleic acid molecule of claim 1 in 
which the promoter element is the PEPCK promoter. 



4 . The nucleic acid molecule of claim 3 in 
15 which the tetR operator sequence has been inserted 

into the Nhel site of the PEPCK promoter element. 

5. The nucleic acid molecule of claim 1, 2, 3 
or 4 in which the promoter element controls the 

2 0 expression of a gene of interest. 

6. The nucleic acid molecule of claim 5 in 
which the gene of interest is bovine growth hormone. 

7. A non-human transgenic animal that carries, 
a transgene, the nucleic acid molecule of claim 1, 
3 or 4. 

8. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 5. 

9. A non-human transgenic animal that carries, 
as a transgene, the nucleic acid molecule of claim 6. 



25 

as 
2, 



10. The non-human transgenic animal of claim 7 
that further carries a transgene encoding the tetR 
repressor protein. 



11. The non-human transgenic animal of claim 8 
that further carries a transgene encoding the tetR 
repressor protein. 

12. The non-human transgenic animal of claim 9 
that further carries a transgene encoding the tetR 
repressor protein. 

13 . A non-human transgenic animal that carries a 
transgene encoding the tetR repressor protein. 

14 . A method of selectively inducing the 
expression of a gene of interest in a non-human 
transgenic animal comprising administering a 
tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
modified to comprise a tetR operator seguence and a 
second transgene encoding the tetR repressor protein. 

15. A non-human transgenic animal that carries 
(i) a first transgene that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 
Nhel site; and (ii) a second transgene that encodes 
tetR repressor protein. 

16. The transgenic animal of claim 15 that is a 
mouse . 
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17. The transgenic animal of claim 15 that is a 

pig. 

5 18. A substantially purified and isolated 

nucleic acid molecule comprising an optimized tetR 
gene as depicted in Figure 16. 

19. The non-human transgenic animal of claim 7 
10 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

20. The non-human transgenic animal of claim 8 
15 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

21. The non-human transgenic animal of claim 9 
20 that further carries an optimized transgene encoding 

the tetR repressor protein and having a sequence as 
depicted in Figure 16. 

22. A non-human transgenic. animal that carries 
25 an optimized transgene encoding the tetR repressor 

protein and having a sequence as depicted in Figure 
16. 

23. A method of selectively inducing the 
30 expression of a gene of interest in a non-human 

transgenic animal comprising administering a 
tetracycline compound to a non-human transgenic animal 
that carries a first transgene which is a gene of 
interest under the control of a promoter element 
35 modified to comprise a tetR operator sequence and a 

second optimized transgene encoding the tetR repressor 
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protein and having a sequence as depicted in Figure 
16. 

24. A non-human transgenic animal that carries 
(i) a first transgene that encodes bovine growth 
hormone and is under the control of PEPCK promoter 
element modified to contain a tetR operator at the 
Nhel site; and (ii) a second optimized transgene that 
encodes tetR repressor protein that has a sequence as 
depicted in Figure 16. 



10 



15 



25. The transgenic animal of claim 24 that is a 
mouse. 

26. The transgenic animal of claim 24 that is a 

pig. 
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EcoRl OP1 linker 

ggaattcgat-ACT CTA TCA TTG ATA GAG TATCAAGCTTAT CCC 

OP2 AccI 
TAT CAG TGA TAG AGA-taccgtcgacctc 
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10 20 30 40 

» * « « 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG 
MSRLDKSKVINSALE> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); CO0ON_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

< o_1520__o o_903 TO 1526 OF TRN10TETR_o_1490_o o 

50 60 70 80 90 



CTG CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC 
LLNEVG IEGLTTRKL> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C000N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

<1480_o o 1470.903 TO 1526 OF TRN10TETR_0_o o I440o 

100 110 120 130 

« * * » 

GCC CAG AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA 
AOKLGVEQPTLYWHV> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

< o_1430_o o_903 TO 1526 OF TRN10TETR_o_1400_o o 

140 150 160 170 180 



AAA AAT AAG CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA 
KNKRALLDAL AIEML> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

<1390_o o 1380_903 TO 1526 OF TRN10TETR_0_o o 1350o 

190 200 210 220 

* * * * 

GAT AGG CAC CAT ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG 
DRHHTHFCPLEGESW> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); CO0ON_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

< o_1340_o o_903 TO 1526 OF TRN10TETR_o_1310_o o 
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230 240 250 260 270 

* « * * » 

CAA GAT TTT TTA CGT AAT AAC GCT AAA AGT TTT AGA TGT GCT TTA 

ODF LR NNAKSFRCAL> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA (SPLIT]_b b b b > 

<1300_o o 1290.903 TO 1526 OF TRN10TETR_0_o o 1260a 



280 290 300 310 

* * * * 

CTA AGT CAT CGC GAT GGA GCA AAA GTA CAT TTA GGT ACA CGG CCT 
L SHRDGAKVHLGTRP 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C000N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLITj.b b b b > 

< o_1250__o O.903 TO 1526 OF TRN10TETR__o_1220_o o 



320 330 340 350 360 

• * * * * 

ACA GAA AAA CAG TAT GAA ACT CTC GAA AAT CAA TTA GCC TTT TTA 

TEKQY. ETLENQLAFL> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

__b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

<1210_o o 1200_903 TO 1526 OF TRN10TETR_0_a o 1170o 



370 380 390 400 

« » .* * 

TGC CAA CAA GGT TTT TCA CTA GAG AAT GCA TTA TAT GCA CTC AGC 

CQQGFSLENALYALS> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLITj.b b b b > 

< o_1160_o o_903 TO 1526 OF TRN10TETR o_1 130 o o 

410 420 430 440 450 

* * * * * 

GCT GTG GGG CAT TTT ACT TTA GGT TGC GTA TTG GAA GAT CAA GAG 
AVGHFTLGCVLEDQE 

TETRACYCLINE REPRESSOR PROTEIN (TETR); CO0ON_START=1 > 

b b b TETR REPRESSOR MRNA [SPLITj.u b b b > 

<1120_o o 1110.903 TO 1526 OF TRN10TETR.0.O o 1080o 
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460 470 480 490 

« * * » 

CAT CM GTC GCT AM GM GAA AGG GAA ACA CCT ACT ACT GAT AGT 
HOVAKEERETPTTDS 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

< o_1070_o o_903 TO 1526 OF TRN10TETR_o_1040_o o 

500 510 520 530 540 

• » * ♦ * 

ATG CCG CCA TTA TTA CGA CM GCT ATC GM TTA TTT GAT CAC CAA 
MPPLLRQAIELFDHQ> 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLIT]_b b b b > 

<1030_o o 1020.903 TO 1526 OF TRN10TETR_0_o o o990o 



550 560 570 580 

* * * * 

GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GM TTG ATC ATA TGC 
GAEPAFLFGLELIIO 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C0D0N_START=1 > 

b b b TETR REPRESSOR MRNA [SPLITj.b b b b > 

< o_980_o o_903 TO 1526 OF TRN10TETR o 950_o o 



590 600 610 620 

* ♦ * * 

GGA TTA GM MA CM CTT MA TGT GM AGT GGG TCT TM 
GLEKQLKCESGS 

TETRACYCLINE REPRESSOR PROTEIN (TETR); C000N > 

b b TETR REPRESSOR MRNA [SPLIT]_b b b > 

<940_o o 903 TO 1526 OF TRN10TETR 910.0 o 
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LT-1 

EcoR5 and EcoR I 

GATATCGAATTCATGAGTAGATTGGACAAGAGCAAAGTGATCAATAGTGC 

TCTGGAGCTGTTGAATGAAGTGGGCATAGAAGGTCTGACTACCAGAAAGC 

TGGCCCAGAAGCTGGGAGTGGAGCAGCCAACATTGTACTGGCATGTGAAG 

AATAAGAGGGCTCTGCTGG ATGCATT GG CGGTACC AGGC 

Nsil Kpnl 



LT-2 

Kpnl Nsil 



GCTCGGTACCTGGATGCATTGGCCATTGAGATGCTGGACAGACACCATAC 

ACACTTCTGCCCACTGGAAGGCGAGAGTTGGGAGGACTTCCTGAGGAACA 

ATGCTAAGAGTTTCAGATGTGCTCTGTTGAGCCACAGAGACGGTGCTAAA 

GTGC A C CTG G A ATTC G AGC 
ApaLl EcoR I 

LT-3 

EcoRl ApaLl 



GCTCGAATTCAAAGTGCACCTGGGTACAAGGCCAACAGAGAAACAGTACG 

AGACCCTGGAGAACCAGCTGGCATTTCTGTGCCAACAAGGCTTCAGCCTG 

GAGAATGCATTGTATGCTCTGAGTGCTGTGGGTCACTTCACACTGGGTTG 

TCTCCTGGAGGACCAGGAGCACCAGGTGGCCAAGGAGGAGAGGGAGACCC 

CAACCACTGACA GCATGC CCC GGATCCG AGC 

Sphl BanHI 

LT-5 

BamHl Sphl 

GCTCGGATCCACAGCATGCCCCCATTGCTGAGACAGGCCTATGAGCTGTT 
TGACCACCAAGGGGCAGAGCCTGCTTTTCTGTTTGGCCTGGAGCTCATCA 

TCTGTGGTCTGGAGAAGCAGCTGAAGTGTGAGAGTGGCTCCTG AAGGTTG 

A ^ ^ Hind3/EcoR5 
ATATC 
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GATATCGAAT 

TCAATAGTGC 

AGGTCTGACT 

GAGCAGCCAA 

CTCTGCTGGA 

CCATACACAC 

GACTTCCTGA 

TGTTGAGCCA 

AAGGCCAACA 

CTGGCATTTC 

CATTGTATGC 

TTGTGTCCTG 

GAGAGGGAGA 

TGAGACAGGC 

GCCTGCTTTT 

CTGGAGAAGC 

TGATATC 



TCATGAGTAG 

TCTGGAGCTG 

ACCAGAAAGC 

CATTGTACTG 

TGCATTGGCC 

TTCTGCCCAC 

GGAACAATGC 

C AG AG ACGGT 

GAGAAACAGT 

TGTGCCAACA 

TCTGAGTGCT 

GAGGACCAGG 

CCCCAACCAC 

GATAGAGCTG 

CTGTTTGGCC 

AGCTGAAGTG 



ATTGGACAAG 

TTGAATGAAG 

TGGCCCAGAA 

GCATGTGAAG 

ATTG AG ATGC 

TGGAAGGCGA 

TAAGAGTTTC 

GCTAAAGTGC 

ACGAGACCCT 

AGGCTTCAGC 

GTGGGTCACT 

AGCACCAGGT 

TGACAGCATG 

TTTGACCACC 

TGGAGCTCAT 

TGAGAGTGGC 



AGCAAAGTGA 

TGGGCATAGA 

GCTGGGAGTG 

A AT AAG AGGG 

TGGACAGACA 

GAGTTGGCAG 

AGATGTGCTC 

ACCTGGGTAC 

GGAGAACCAG 

CTGGAGAATG 

TCACACTGGG 

GGCCAAGGAG 

CCCCCATTGC 

AAGGGGCAGA 

CATCTGTGGT 

TCCTGAAGCT 
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